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TECHNICAL FIELD 

The present invention relates to a phospholipase D gene originated from a plant. 
BACKGROUND ART 

Phospholipase D (hereinafter also referred to as "PLD") is one of phospholipid-decomposing enzymes 
which catalyzes, for example, the reaction of decomposing lecithin to liberate phosphatidic acid and choline" 
This enzyme is known to occur in plants, animals and microorganisms. PLD is used as a reagent for 
measunng phospholipids in blood, as well as for hydrolysis of phospholipids and for production of 
derivatives by a base-exchange reaction utilizing the reversible reaction by PLD. 

Purification and partial purification of PLDs originated from plants have been reported That is 
purification of rice PLD (Men Hui Lee, 1989, Phospholipase D of rice bran. I. purification and characteriza- 
tion, Plant Science 59. 25-33); partial purification of rice PLD (Katsumi TAKANO. Ikuzo KAMOl and Tetsujiro 
OBARA, 1987. About separation and purification, and properties of rice bran phospholipase D J Jpn Soc 
Food Sci. Technol.. 34. 8-13 (1987)); purification of peanuts PLD (Michael Heller, Nava Mozes Irena Peri 
and Eddie Maes. 1974, Phospholipase D from peanut seeds. IV. Final purification and some properties of 
the enzyme. Biochem. Biophys. Acta 369. 397-410); and purification of cabbage PLD (Romy Umbrecht and 
Renate Ulbrich-Hofmann, 1992. A facile purification procedure of phospholipase D from cabbage and its 
characterization. Biol. Chem. Hoppeseyler 373(2). 81-88) have been reported. However, amino acid 
sequences of plant PLDs have not been reported at all and genetical analyses thereof have also not been 
reported. 

On the other hand, analyses of PLD genes of microorganisms and animals have been reported 
(Japanese Laid-open Patent Application (Kokai) No. 3-187382; Adrian L. M. Hodgson. Phillip Bird, and Ian T 
Nisbet 1990. Cloning, nucleotide sequence, and expression in Escherichia coli of the phospholipase D 
gene from Corynebacterium pseudotuberculosis. Journal of Bacteriology 172. 1256-1261- and Japanese 
Laid-open Patent Application (Kokai) No. 5-76357). 

If a PLD gene originated from a plant is available, the PLD originated from the plant may be produced 
in a large scale by a genetic engineering process, which is industrially advantageous. However as 
mentioned above, since the amino acid sequence and DNA sequence of plant PLD have not been reported 
It .s hitherto impossible to genetically manipulate the PLD gene. Further, as mentioned above, although the 
ammo acid sequences and DNA sequences of PLDs of microorganisms and animals have been reported 
since homologies of sequences are not observed among the PLD genes of microorganisms or among the 
PLD genes of microorganisms and animals, it is difficult to isolate plant PLD gene based on the reported 
information. Further, since there is no information about plant PLD gene, an antisense DNA which 
suppresses expression of the plant PLD gene has not been obtained. 

DISCLOSURE OF THE INVENTION 

An object of the present invention is to provide a PLD gene originated from a plant. Another object of 
the present invention is to provide an antisense DNA which can suppress the expression of the above- 
nrnC hT! nMA f "k *° P-^^s^"* \n^,e;^^:^or^. Still another object of the present invention is to 

provide a DNA which regulates expression of the PLD gene originated from a plant 

^.^r '"*®"f'''® ^'^'^y- P^®se"t inventors succeeded in isolation of rice PLD gene and in sequencing 
r?K,f r! ^ """"^^'"^ ^"-^ determining the partial amino acid sequence thereof, screening a 

rice cDNA library using as a probe the PGR product obtained by using the oligonucleotides encoding the 
above-mentioned partial amino acid s.)quence. cloning the inserted gene of positive clones and sequencinq 
the inserted gene. Further, the present inventors succeeded in obtaining a cDNA clone of maize PLD using 

Tl^! """^ ^"^ *" ^^•'"^"^'"g the maize PLD gene. The present inventors still 

further succeeded m isolating a genome DNA clone carrying the regulatory sequence of rice PLD gene and 
m sequencing it. » » 

That is. the present invention provides a DNA encoding phospholipase D originated from a plant The 

f^romTplanr^'°" ^ ^^"^ ^^""^ regulates the expression of phospholipase D gene originated 

r^MA^^ the present invention, a PLD gene originated from a plant was cloned for the first time. By using the 
DNA according to the present invention, the PLD originated from a plant, which is industrially useful can be 
produced in a large scale by a genetic engineering process. Farther, by the present invention, a DNA which 
regulates expression of the PLD gene originated from a plant was cloned for the first time By virtue of this 
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expression of lipid-related genes may be suppressed, thereby modifying plant lipids. 
BEST MODE FOR CARRYING OUT THE INVENTION 

As mentioned above, the DNA according to the present invention encodes PLD originated from a plant. 
In the examples hereinbelow described, the DNA encoding rice PLD was isolated and sequenced. The 
deduced amino acid sequence encoded by the thus sequenced DNA is shown in SEQ ID NO. 2 in the 
Sequence Usting. The experimentally determined nucleotide sequence encoding the amino acid sequence 
shown in SEQ ID NO. 2 is shown in SEQ ID NO. 1. Therefore, the amino acid sequence shown in SEQ ID 
NO. 1 is identical to the amino acid sequence shown in SEQ ID NO. 2. Further, in the examples 
hereinbelow described, a cDNA clone of maize PLD gene was isolated using as a probe the thus 
sequenced DNA encoding the rice PLD and the maize PLD gene was sequenced- The deduced amino acid 
sequence encoded by the thus sequenced DNA is shown in SEQ ID NO. 4 in the Sequence Listing. The 
experimentally determined nucleotide sequence encoding the amino acid sequence shown in SEQ ID NO. 4 
is shown in SEQ ID NO. 3. Therefore, the amino acid sequence shown in SEQ ID NO. 3 is identical to the 
amino acid sequence shown in SEQ ID NO. 4. 

Further, in the examples described below, rice cDNA library was screened using as a probe the thus 
obtained DNA encoding rice PLD and positive clones were isolated, followed by sequencing the inserted 
DNA to obtain a clone carrying an inserted DNA having the nucleotide sequence shown in SEQ ID NO. 5. In 
SEQ ID NO- 5. the 1876th base is "A" which is the first "A" in the translation initiation codon ATG of the 
PLD structural gene, and the region upstream of the "A" (i.e.. 1-1875nt) is thought to be a regulatory region 
of the PLD gene. The amino acid sequence encoded by the region downstream of 1876nt is interrupted by 
intron. 

The DNA according to the present invention may be obtained by. for example, the method described in 
the examples below in which rice or maize is used as the starting material. Alternatively. • since the 
nucleotide sequence of the gene was disclosed by the present invention, the PLD gene may easily be 
prepared by PGR method using as primers a pair of oligonucleotides each of which hybridizes with the 
respective end of the DNA according to the present invention and using the genome DNA of rice or maize 
as a template. 

A plant producing PLD may be obtained by inserting the DNA encoding PLD originated from a plant 
according to the present invention into a vector for plants by a conventional method and transforming a 
plant with the obtained recombinant vector by a conventional method. 

By inserting the DNA according to the present invention into a vector for plants in the reverse direction, 
an antisense DNA of PLD gene may be obtained. By transforming a plant with such a recombinant vector, 
the mRNA which hybridizes with the mRNA formed by transcription of the PLD gene intrinsic to the plant is 
produced, so that expression of the PLD gene intrinsic to the plant may be suppressed. If this technique Is 
applied to a rice plant, a rice plant in which expression of PLD gene is suppressed is obtained, so that the 
taste of the rice is improved. 

The DNA regulating the PLD gene originated from a plant according to the present invention may be 
used in combination with the antisense DNA of the PLD gene. By this, the antisense DNA can be expressed 
at the same place as the place in which the PLD gene intrinsic to the plant is expressed. 

It is well-known in the art that there are cases wherein the activity of an enzyme is retained even if the 
amino acid sequence of an enzyme is modified to a small extent, that is. even if one or more amino acids in 
the amino acid sequence are substituted or deleted, or even if one or more amino acids are added to the 
amino acid sequence. DNAs encoding the proteins having such modifications and having PLD activity are 
included within the scope of the present invention. That is. cloned DNAs encoding amino acid sequences 
having the same amino acid sequence as SEQ ID NO. 2 or SEQ ID NO. 4 except that one or more amino 
acids are substituted, deleted or added, which give the enzyme activity of PLD, are also included in the 
scope of the present invention. Similarly. DNAs having the same nucleotide sequence as SEQ ID NO. 5 
except that one or more nucleotides are substituted, deleted or added, which regulate expression of the 
DNA encoding the amino acid sequence giving the enzyme activity of PLD are also included within the 
scope of the present invention. 

Modification of DNA which brings about addition, deletion or substitution of the amino acid sequence 
encoded thereby can be attained by the site-specific mutagenesis which is well-known in the art (e.g.. 
Nucleic Acid Research, Vol. 10. No. 20, p6487-6500. 1982). In the present specification, "one or more 
amino acids" means the number of amino acids which can be added, deleted or substituted by the site- 
specific mutagenesis. 
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Site-specific mutagenesis may be carried out by, for example, using a synthetic oligonucleotide primer 
complementary to a single-stranded phage DNA except that the desired mutation as follows. That is, using 
the above-mentioned synthetic oligonucleotide as a primer, a complementary chain is produced by a 
phage, and host bacterial cells are transformed with the obtained double-stranded DNA. The culture of the 

5 transformed bacterial cells is plated on agar and plaques are formed from a single cell containing the 
phage. Theoretically, 50% of the new colonies contain the phage having a single-stranded chain carrying 
the mutation and remaining 50% of the colonies contain the phage having the original sequence. The 
obtained plaques are then subjected to hybridization with a kinase-treated synthetic probe at a temperature 
at which the probe is hybridized with the DNA having exactly the same sequence as the DNA having the 

fo desired mutation but not with the original DNA sequence that is not completely complementary with the 
probe. Then the plaques in which the hybridization was observed are picked up, cultured and the DNA is 
collected. 

In addition to the above-mentioned site-specific mutagenesis, the methods for substituting, deleting or 
adding one or more amino acids without losing the enzyme activity include a method in which the gene is 
5 treated with a mutagen and a method in which the gene is selectively cleaved, a selected nucleotide is 
removed, added or substituted and then the gene is ligated. 

Examples 



20 The present invention will now be described more concretely by way of examples. However, the 
present invention is not restricted to the examples described below. 

1 . Purification of rice bran PLD 



25 For the purification, a reference relating to the purification of rice bran PLD (Takano et al., J. Jpn. Soc. 
Food Sci. Technol.. 34, 8-13 (1987)) was referred. The enzyme activity was measured by using 
phosphatidylcholine as the substrate and by quantifying the choline generated by the enzyme reaction by 
the choline oxidase method (Imamura et al., J. Biochem. 83, 677-680 (1978)). The enzyme reaction, 
however, was stopped by heat treatment at 95 * C for 5 minutes. 

30 That is, one liter of hexane was added to 100 g of rice bran of Oryza sativa variety "Koshihikari" and 
the mixture was stirred overnight to defat the rice bran. Then 10 g of POLYCRAL AT (trademark: 
polyvinylpyrrolidone commercially available from GAF Chemical) and 500 ml of 10 mM Tris-HCI buffer (pH 
7) containing 1 mM CaCb and 5 mM 2-mercaptoethanol were added and the resulting mixture was stirred 
for one hour to extract enzyme. The extract was filtered through 8-ply gauze and the filtrate was centrifuged 

35 at 15.000 X g for 20 minutes. The intermediate layer was collected as a crude extract. The crude extract 
was treated with ammonium sulfate (65% saturation) and formed precipitates were collected by centrifuga- 
tion (15,000 X g. 20 minutes). The precipitates were dissolved and dialyzed against the above-described 
buffer. After the dialysis, precipitates were removed by centrifugation to obtain a ammonium sulfate fraction. 
The ammonium sulfate fraction was applied to a DEAE-Cellulose (commercially available from Whatt- 

40 man) column (2.0 x 10 cm) equilibrated with buffer A (10 mM Tris-HCI pH 7, 1 mM CaCb. 1 mM 2- 
mercaptoethanol). After washing the column with about 100 ml of buffer A containing 50 mM NaCI. elution 
was performed with 120 ml of buffer A having linear gradient of NaCI from 50 to 350 mM. PLD was eluted 
in the vicinity of 0.2 M of NaCI. The fractions exhibiting PLD activity were collected as an eluted solution 
(DEAE-Cellulose). 

45 To the eluted solution (DEAE-Cellulose), 3 M of ammonium sulfate was added to a concentration of 1 M 
and the resultant was applied to a Phenyl Sepharose column (commercially available from PHARMACIA. 2.6 
X 10 cm). Then the column was subjected to elution with 240 ml of buffer A having a gradient of ammoriium 
sulfate from 1.0 to 0 M. PLD was eluted in the vicinity of 0.1 M of ammonium sulfate. Fractions exhibiting 
PLD activity were collected and dialyzed against buffer A to obtain an eluted solution (Phenyl Sepharose). 

50 The eluted solution (Phenyl Sepharose) was applied to a Mono Q column (anion-exchange column 
commercially available from PHARMACIA, 16 x 10 cm) and then elution was carried out with 150 ml of 
buffer A having a gradient of NaCI from 50 - 350 mM. PLD was eluted at NaCI concentrations from 210 mM 
to 235 mM. The fractions exhibiting PLD activity were collected and dialyzed against buffer A to obtain an 
eluted solution (Mono Q 1st). 

55 The eluted solution (Mono Q 1st) was concentrated to 0.5 ml by centrifugal ultrafiltration and the 
resultant was applied to Superose 6 column (commercially available from PHARMACIA. 1.0 x 30 cm) 
equilibrated with buffer A containing 0.1 M NaCI. and the column was subjected to elutiori with the same 
buffer. The molecular weight of the PLD was estimated as 78 kDa. The fractions exhibiting PLD activity 
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were collected as an eluted solution (Superose 6). 

To the eluted solution (Superose 6). 2.5 ml of 40% CARRIER AMPHOLITE (commercially available from 
PHARMACIA, pH 4.0-6.0) and distilled water were added to a final volume of 50 mL The resultant was then 
subjected to isoelectric focusing using Rotofore (commercially available from BIORAD). The electrophoresis 
5 was carried out at 2*0 under a constant power of 12W for 4 hours. PLD activity was observed in the 
vicinity of pH4.9- The fractions exhibiting PLD activity were collected and the obtained solution was dialyzed 
against buffer A to obtain an isoelectric focusing fraction. 

The isoelectric focusing fraction was applied to a Mono Q column (commercially available from 
PHARMACIA, 0.5 x 5 cm) and elution was carried out using buffer A having a linear gradient of NaCI from 
70 50 mM to 350 mM. PLD was eluted in the vicinity of 210 mM and 235 mM NaCI. The two fractions 
exhibiting PLD activity were collected as eluted solutions (Mono Q 2nd-l and Mono Q 2nd-ll). 

Purities of the eluted solutions (Mono Q 2nd-l and Mono Q 2nd-ll) were determined by SDS- 
polyacrylamide gel electrophoresis (Laemmli (1970)). After the electrophoresis, the gels were stained with 
COOMASSIE BRILLIANT BLUE R250. With either eluted solution, a main band was observed at the position 
75 of molecular weight of 82 kDa. A single band was observed with eluted solution (Mono Q 2nd-ll). 

By the above-described purification, eluted solutions (Mono Q 2nd-l and Mono Q 2nd-ll) were 380-fold 
and 760-fold purified, respectively, based on the crude extract. 

The two fractions were analyzed for properties of the enzymes contained therein. The results are shown 
in Table 1. The buffer solutions used for measuring the optimum pH were sodium acetate (pH4-6). MES- 
20 NaOH (pH5.5-7.0) and Tris-HCI (pH7-9). all of which had a concentration of 100 mM. The pH stability 
means the range in which decrease in activity was not observed after leaving the enzyme to stand at 25»C 
for 30 minutes. The temperature stability was measured by measuring remaining activity after leaving the 
enzyme to stand at 4 • C, 25 'C. 37' C or 50* C for 30 minutes. The substrate specificity was measured at a 
substrate concentration of 5 mM and is expressed as a relative activity taking the enzyme activity to 
25 phosphatidylcholine as 100. 



Table 1 



30 




Mono Q 2nd-l 


Mono Q 2nd-ll 


35 


Km Value 
Optimum pH 
pH Stability 
Temperature Stability 
Ca2+ Dependency 


0.29 mM 
6 

7-8 
4-37 -C 
20 mM or more 


0.29 mM 
6 

7-8 
4-37 • C 
20 mM or more 




Substrate Specificity 






40 


Phosphatidylcholine 

Lysophosphatidylcholine 

Sphingomyelin 


100 
13 
6 


100 
12 
4 



2. Proof that Purified Protein is PLD 

In the same manner as in the determination of the purity, eluted solutions (Mono Q 2nd-l and Mono Q 
2nd-ll) were separately subjected to SDS-polyacrylamide gel electrophoresis and each of the gels was 
transcribed to a PVDF membrane (commercially available from MILLIPORE). followed by staining the 
membrane. The band corresponding to the protein having a molecular weight of 82 kOa was cut out and 
amino acid sequence of the N-terminal region of the protein was determined. Amino acid sequence up to 
10th amino acid residue was able to be determined for both proteins, and both proteins had the same 
sequence as follows: 
Val Gly Lys Gly Ala Thr Lys Val Tyr Ser 

Although the relationship between the proteins of 82 kDa which existed in two fractions having the 
enzyme activity was unknown, at least the homology of the amino acid sequences thereof was thought to 
be high. Thus, it was thought that there was no problem even if a mixture of these fractions was used as an 
antigen for producing an antibody. 

A mixture of the eluted solutions (Mono Q 2nd-l and Mono Q 2nd-ll) was subjected to SDS- 
polyacrylamide gel electrophoresis employing 7.5% acrylamide and the gel was stained with COOMASSIE 
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BRILLIANT BLUE R250. The band containing the protein of 82 kDa was cut out and the proteins were 
recovered by eiectroelution (25 mM Trrs, 192 mM glycine, 0.025% SDS, 100V. 10 hours). After removing 
SDS by etectrodialysis (15 nnM ammonium bicarbonate, 200V, 5 hours), the resultant was freeze-dried. The 
eiectroelution and electrophoresis were performed by using BIOTRAP (commercially available from 
SCHLEICHER & SCHUELL). 

Rabbits were Immunized with the protein of 82 kDa highly purified by the above-described method. 
Immunization was performed by administering 50 ug of the protein per time at 7 days' interval. Using blood 
serum before the immunization and after the third immunization, immunotitration was performed. That is, 
PLD solution containing 8.6 x lO-^ units of the enzyme, 0 to 50 ul of the serum before the immunization or 
after the third immunization. 50 ul of 250 mM Tris-HCI (pH 7.0), 5 ul of 50 mM CaCb, 50 ti\ of 0.2% 
TRITON X-100 (trademark) and balance of water were mixed to a final volume of 250 ul and the mixture 
was left to stand at room temperature for 2.5 hours. To the resulting mixture, 200 ul of Protein A 
SEPHAROSE (commercially available from PHARMACIA) was added and the resultant was gently shaken at 
room temperature for 2 hours. The resultant was centrifuged (500 x g, 5 minutes) and enzyme activity of 
the supernatant was measured. Taking the enzyme activity when no serum was added as 100%, the 
enzyme activities measured when 20 ul or 50 ul of the serum before the immunization were 95% and 88%. 
respectively, while the enzyme activities measured when 20 ul or 50 ul of the serum after the third 
immunization were 75% and 30%. respectively. These results proved that the protein of 82 kDa is PLD. 

3. Determination of Internal Amino Acid Sequence 

The PLD protein was fragmented by fragmenting the protein in a gel (Cleveland et aL, J. Biol. Chem.. 
252. 1102(1977)). The gel containing the PLD protein, which was cut out by the same method as described 
in 2 was inserted in a stacking gel well on a 15% acrylamide gel prepared for separation of peptides. 
Staphylococcus aureus V8 protease (commercially available from WAKO PURE CHEMICAL INDUSTRIES, 
LTD) in an amount of 1/10 of the PLD protein was overlaid and electrophoresis was started. The 
electrophoresis was interrupted when bromophenol blue reached the center of the stacking gel, and 30 
minutes after, the electrophoresis was restarted. After the electrophoresis, the gel was transcribed to a 
PVDF membrane and the PVDF membrane was stained. Clear bands were observed at positions cor- 
responding to molecular weights of 20. 14, 13, 11 and 10 kDa. The bands corresponding to molecular 
weights of 20. 14 and 13 kDa were cut out and amino acid sequences of the peptide fragments contained in 
the bands were determined by a protein sequencer. The sequences are as follows: 



20 kDa: 


Asn 


Tyr 


Phe 


His Gly Ser 


Asp 


Val 


Asn 


7 


Val 


Leu 


? Pro 


Arg 


Asn 


Pro 


Asp Asp (Asp) 


7 


7 


He 








14 kDa: 


Thr 




Asn 


Val Gin Leu 


Phe 


Arg 


Ser 


He 


Asp 


Gly 


Gly Ala 


Ala 


Phe 


Gly 


Phe Pro Asp 


Thr 


Pro 


Glu 


Glu 


Ala 


Ala 


Lys ? 


Gly 


Leu 


Val 


Ser Gly 














13 kDa: 


lie 


Ala 


Met 


Gly Gly Tyr 


Gin 


Phe 


Tyr 


His 


Leu 


Ala 


Thr Arg 


Gin 


Pro 


Ala 


Arg Gly Gin 


He 


His 


Gly 


Phe 


Arg Met 


Ala Leu 


7 


Tyr 


Glu 


His Leu Gly 


Met 


Leu 


7 


Asp 


Val 


Phe 



(In the sequences. means the residue which could not be identified and the amino acid residue in 
parenthesis is one which may be another amino acid residue with a considerable probability.) 
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4. Preparation of cDNA Library of Rice Immature Seeds 

Total RNAs were prepared from immature seeds 5 days after blossom by extracting RNAs by the SDS- 
phenol method and by precipitating the extract with lithium chloride. Poly(A)+RNAs were prepared using 
OLtGOTEX-dT30 (commercially available from TAKARA SHUZO) in accordance with the instructions by the 
manufacturer. For cDNA cloning, cDNA SYNTHESIS SYSTEM PLUS (commercially available from AMER- 
SHAM) and cDNA CLONING SYSTEM XgttO (commercially available from AMERSHAM) were used. As the 
cloning vector, XZAPII vector (commercially available from STRATAGENE) was used and as the host cells, 
XL1-Blue was used. 

5. Preparation of Probes 

Oligonucleotides corresponding to the amino acid sequence of PLD were synthesized by a DNA 
synthesizer (commercially available from APPLIED BlOSYSTEMS). The sequences thereof and the amino 
acid sequences corresponding thereto are described below. 

20KF: 5* AAYTAYTTYCAYGG 3' 

20KR1 : 5* RTCRTCRTCNGGRTT 3' 
(wherein R represents purine bases, that is. A or G; Y represents pyrimidine bases, that is, T or C; and N 
represents G, A, T or C.) 

20KF is a mixture of 32 types of oligonucleotides each of which encodes the amino acid sequence of 
Asn Tyr Phe His Gly 

that was found in the peptide having a molecular weight of 20 kDa, and 20KR1 is a mixture of 128 types of 
oligonucleotides each of which encodes the amino acid sequence of 

Asn Pro Asp Asp(Asp) 
that was found in the same peptide. 

The cDNA synthesis reaction was carried out in a mixture of 10 ng of Poly(A)+RNA, 0.3 UQ of random 
hexamer (dN6). 10 U of RNase inhibitor (RNAGuard. commercially available from PHARMACIA). 1 mM each 
of dATP, dCTP. dGTP and dTTP. 1 x PGR buffer (commercially available from TAKAKA SHUZO), 50 mM of 
magnesium chloride and 100 U of reverse transcriptase (M-MuLV RTase. commercially available from 
BRL), the total volume of the reaction mixture being 10 uL The reaction was carried out at 37 'C for 30 
minutes and the mixture was then heated at 95*C for 5 minutes, followed by retaining the resulting mixture 
in Ice. 

Polymerase chain reaction (PGR) was performed using the above-described cDNA as a template, and 
20KF and 20KR1 as primers. The reaction was carried out using 10 ul of the cDNA synthesis reaction 
mixture, a mixture of 50 pmol each of the primers, 200 jllM each of dATP, dCTP, dGTP and dTTP. 1 x PGR 
buffer (commercially available from TAKARA SHUZO) and 2.5 U of AmpliTaq DNA polymerase (commer- 
cially available from TAKARA SHUZO). the total volume of the reaction mixture being 50 ul. A cycle of 
94 -C for 1 minute/40 'C for 1 minute/72 •C for 2.5 minutes was repeated 30 times in DNA THER- 
MOCYCLER (commercially available from PERKIN ELMER CETUS). 

PGR product was separated on 2% agarose gel. Several fragments were detected by staining the gel 
with ethldium bromide. One of them had a size of 94 bp which Is the expected size. 

The PGR fragment was cut out from the gel and subcloned into pUCl9 plasmld. The subcloned PGR 
fragment was sequenced by the dideoxy metiiod using T7 sequencing kit (commercially available from 
PHARMACIA). Between the two primers, a nucleotide sequence encoding the expected amino acid 
sequence was observed. The nucleotide sequence between the two primers and the amino acid sequence 
encoded thereby are as follows: 

C TCT GAG GTG AAC TGT GTT CTA TGC GOT CGC 
Ser Asp Val Asn Cys Val Leu Cys Pro Arg 

Isotope (commercially available from AMERSHAM) was Incorporated into the above-described 
oligonucleotide using DNA 5*-end labelling kit MEGALABEL (commercially available from TAKAKA SHUZO) 
to obtain a radioactive oligonucleotide probe. 
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6. Screening of PLD Gene-containing Clone 

Using the above-described radioactive oligonucleotide as a probe, a cDNA library was screened. The 
hybridization solution was 0.5 M sodium phosphate buffer (pH 7.2) containing 7% SDS. 1 mM EDTA and 

5 100 ug/ml of salmon sperm DNA. and hybridization was carried out at 45-C for 16 hours after adding the 
probe to this solution. The washing solution contained 0.3 M NaCI and 30 mM sodium citrate and washing 
was performed twice at 45 • C for 20 minutes. Positive plaques were isolated and subcloned in vivo into 
pBluescript plasmid (commercially available from STRATAGENE) in accordance with the instruction of the 
manufacturer of XZAPII cloning vector. The nucleotide sequence was determined and the region encoding 

10 the internal amino acid sequence determined in 3 existed. 

7. Sequencing of 5'-end Region 

Since a clone containing the full-length cDNA could not be isolated, a DNA fragment containing the 5'- 
75 end region was prepared by RACE method (Edwards et al.. Nucleic Acids Res., 19, 5227-5232(1991)). 5'- 
AmpliFINDER RACE Kit (commercially available from CLONETECH) was used in accordance with the 
instructions attached to the product- OligoDNAs were prepared and PCR was performed using the mRNA 
prepared by the method described in 4 as a template. The PCR product was subcloned into PCRII vector 
(commercially available from INVITROGEN) and sequenced by the dideoxy method. As a result, it was 
20 estimated that translation is initiated from the 182th nucleotide shown in SEQ ID NO. 1 because a 
termination codon exists upstream thereof by 36 bp. 

Preparation of cDNA Clone Encoding PLD Originated from Maize 

25 Maize cDNA clone was obtained by the method described below using the DNA encoding rice PLD as 

a probe. 

Using suspended cultured cells established by culturing a callus derived from immature embryo of 
maize mbred Mo 17 (commercially available from MIKE BRAYTON SEEDS, INC.) in a liquid culture 
medium, a cDNA library was prepared by the method described in 4. However. XgtIO vector (commercially 

30 available from AMERSHAM) was used as the cloning vector and NM-514 (commercially available from 
AMERSHAM) was used as the host cells. Using the cDNA of rice PLD as a probe, positive plaques were 
isolated by the method described in 6. Phage DNA was prepared in accordance with the instructions by the 
manufacturer of the cloning vector. The phage DNA was digested with a restriction enzyme Kpn I and the 
resultant was subcloned into pBluescript plasmid. The nucleotide sequence was determined by the dideoxy 

35 method as described in 6. 

®' 'so'ation of PLD Genome Clone Corresponding to PLD cDNA and Identification of Promoter Region 

To isolate a genome DNA clone carrying the regulatory sequence of the PLD gene corresponding to the 

40 PLD cDNA sequenced in 6, which was cloned into pBluescript plasmid, a genome library of rice Koshihikari 
was prepared. This was carried out by partially digesting DNAs from leaves of Koshihikari with Mbo I 
purifying fractions having sizes of 16-20 kb by sucrose gradient centrifugation, and by using lambda DASH 
n (commercially available from STRATAGENE) and Gigapackll Gold (commercially available from 
STRATAGENE). Using the PLD cDNA clone as a probe, the genome library was screened. The screening 

45 was performed as described in 6. However, the hybridization was performed at 65-C for 16 hours the 
washing solution was 0.5 x SSC containing 0.1% SDS and washing was performed twice at 65-C for 20 
minutes. The nucleotide sequence of the hybridized genome clone was determined by the dideoxy method. 
As a result, a region homologous to the cDNA sequence determined in 6 existed. 

The transcription initiation site was determined by the method described in 7. In the vicinity of the 

50 transcription initiation site. "TATA" consensus sequence box was observed. The ATG translation initiation 
site was determined as the upstream most ATG codon in the translation reading frame of the clone and as 
the ATG codon which is first accessible in the mRNA synthesized in rice. 

A part of the DNA sequence of the genome clone hybridized with the cDNA clone is shown in SEQ ID 
NO. 5. In the genome DNA sequence, a reading frame starting from the ATG translation initiation codon 

55 which overlaps with the corresponding cDNA sequence was identified. The promoter region is located 
upstream of the ATG translation initiation codon and starts immediately upstream thereof. 
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SEQ ID NO. :1 
SEQUENCE LENGTH: 3040 

5 

SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE: cDNA to mRNA 
10 ORIGINAL SOURCE 

ORGANISM: Oryza sativa 
SEQUENCE DESCRIPTION 

AGTCTCTCTT CTCCCGCAAT TTTATAATCT CGATCGATCC AATCTGCTCC CCTTCTTCTT 60 
CTACTCTCCC CATCTCGGCT CTCGCCATCG CCATCCTCCT CTCCCTTCCC GGAGAAGACG 12 0 
CCTCCCTCCG CCGATCACCA CCCGGTAGGG CGAGGAGGGA GCCAAATCCA AATCAGCAGC 180 
C ATG GCG CAG ATG CTG CTC CAT GGG ACG CTG CAC GCC ACC ATC TTC GAG 22 9 
Met Ala Gin Met Leu Leu His Gly Thr Leu His Ala Thr lie Phe Glu 
25 1 5 10 15 

GCG GCG TCG CTC TCC AAC CCG CAC CGC GCC AGC GGA AGC GCC CCC AAG 277 
Ala Ala Ser Leu Ser Asn Pro His Arg Ala Ser Gly Ser Ala Pro Lys 

20 25 30 

TTC ATC CGC AAG TTT GTG GAG GGG ATT GAG GAC ACT GTG GGT GTC GGC 325 
Phe lie Arg Lys Phe Val Glu Gly He Glu Asp Thr Val Gly Vai Gly 

3S 40 45 

AAA GGC GCC ACC AAG GTG TAT TCT ACC ATT GAT CTG GAG AAA GCT CGT 373 
Lys Gly Ala Thr Lys Val Tyr Ser Thr He Asp Leu Glu Lys Ala Arg 

50 55 60 

GTA GGG CGA ACT AGG ATG ATA ACC AAT GAG CCC ATC AAC CCT CGC TGG 421 
Val Gly Arg Thr Arg Met He Thr Asn Glu Pro He Asn Pro Arg Trp 
" 70 75 80 

TAT GAG TCG TTC CAC ATC TAT TGC GCT CAT ATG GCT TCC AAT GTG ATC 469 
Tyr Glu Ser Phe His He Tyr Cys Ala His Met Ala Ser Asn Val He 



30 



35 



55 



75 



20 
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85 90 95 

TTC ACT GTC AAG ATT GAT AAC CCT ATT GGG GCA ACG AAT ATT GGG AGG 517 
Phe Thr Val Lys lie Asp Asn Pro He Gly Ala Thr Asn He Gly Arg 

105 110 
GCT TAG CTG CCT GTC CAA GAG CTT CTC AAT GGA GAG GAG ATT GAC AGA 565 
Ala Tyr Leu Pro Val Gin Glu Leu Leu Asn Gly Glu Glu He Asp Arg 

120 125 
TGG CTC GAT ATC TGT GAT AAT AAC CGC GAG TCT GTT GGT GAG AGC AAG 613 
Trp Leu Asp He Cys Asp Asn Asn Arg Glu Ser Val Gly Glu Ser Lys 

135 140 
ATC CAT GTG AAG CTT GAG TAC TTC GAT GTT TCC AAG GAT CGC AAT TGG 661 
He His Val Lys Leu Gin Tyr Phe Asp Val Ser Lys Asp Arg Asn Trp 
1^5 ISO 155 160 

GCG AGG GGT GTC CGC AGT ACC AAG TAT CCA GGT GTT CCT TAC ACC TTC 709 
Ala Arg Gly Val Arg Ser Thr Lys Tyr Pro Gly Val Pro Tyr Thr Phe 

1^5 170 175 

TTC TCT CAG AGG CAA GGG TGC AAA GTT ACC TTG TAC CAA GAT GCT CAT 757 
Phe ser Gin Arg Gin Gly cys Lys Val Thr Leu Tyr Gin Asp 7U.a His 

10^> 185 190 

GTC CCA GAC AAC TTC ATT CCA AAG ATT CCG CTT GCC GAT GGC AAG AAT 805 
Val Pro Asp Asn Phe He Pro Lys He Pro Leu Ala Asp Gly Lys Asn 

195 200 205 

TAT GAA CCC CAC AGA TGC TGG GAG GAT ATC TTT GAT GCT ATA AGC AAT 853 
Tyr Glu Pro His Arg Cys Trp Glu Asp He Phe Asp Ala He Ser Asn 

210 215 220 

GCT CAA CAT TTG ATT TAC ATC ACT GGC TGG TCT GTA TAC ACT GAG ATC 901 
Ala Gin His Leu He Tyr He Thr Gly Trp Ser Val Tyr Thr Glu He 



25 



30 



35 



45 



50 



55 



10 



70 



75 



20 



25 



35 



40 



45 



SO 



BP 0 685 559 A1 

^" 230 

ACC TTG GTT AGG GAC TCC AAT CGT CCA AAA CCT GOA GGG GAT GTC ACC 
Thr Leu Val Arg Asp Ser Asn Arg Pro Lys Pro Gly Gly Asp Val Thr 

245 250 255 

CTT GGG GAG TTG CTC AAG AAG AAG GCC AGT GAA GGT GTT CGG GTC CTC 
Leu Gly Glu Leu Leu Lys Lys Lys Ala Ser Glu Gly Val Arg Val Leu 

260 265 270 

ATG CTT GTG TGG GAT GAC AGG ACT TCA GTT GGT TTG CTA AAG AGG GAT 
Met Leu Val Trp Asp Asp Arg Thr Ser Val Gly Leu Leu Lys Arg Asp 



AGC ATT GTT CAG GAT CTG TCG ATC TCA ACT ATG TTT ACA CAC CAT CAG 
Ser He Val Gin Asp Leu Ser He Ser Thr Met Phe Thr His His Gin 

325 

AAG ATA GTA GTT GTT GAC CAT GAG TTG CCA AAC CAG GGC TCC CAA CAA 
Lys He val Val Val Asp His Glu Leu Pro Asn Gin Gly Ser Gin Gin 

345 

AGG AGG ATA GTC AGT TTC GTT GGT GGC CTT GAT CTC TGT GAT GGA AGG 
Arg Arg He Val Ser Phe Val Gly Gly Leu Asp Leu Cys Asp Gly Arg 

360 365 
TAT GAC ACT CAG TAC CAT TCT TTG TTT AGG ACA CTC GAC AGT ACC CAT 
Tyr Asp Thr Gin Tyr His Ser Leu Phe Arg Thr Leu Asp Ser Thr His 



949 



997 



1045 



275 280 285 

GGC TTG ATG GCA ACA CAT GAT GAG GAA ACT GAA AAT TAC TTC CAT GGC 1093 
Gly Leu Met Ala Thr His Asp Glu Glu Thr Glu Asn Tyr Phe His Gly 

295 300 
TCT GAC GTG AAC TGT GTT CTA TGC CCT CGC AAC CCT GAT GAC TCA GGC 
Ser Asp Val Asn Cys Val Leu Cys Pro Arg Asn Pro Asp Asp Ser Gly 



1141 



1189 



1237 



1285 



1333 
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380 



CAT OAT GAC TTC CAC CAG CCA AAC TTT GCC ACT GCA TCA ATC AAA AAG 
His ASP Asp Phe His Gin Pro Asn Phe Ala Thr Ala Ser He Lys Lys 

GGT GGA CCT AGA GAG CCA TGG CAT GAT ATT CAC TCA CGG CTG GAA GGG 
Gly Gly Pro Arg Glu Pro Trp His Asp He His Ser Arg Leu Glu Gly 



1381 



1429 



403 



410 



415 



CCA ATC GCA TGG GAT GTT CTT TAC AAT TTC GAG CAG AGA TGG AGA AAG 
Pro lie Ala Trp Asp Val Leu Tyr Asn Phe Glu Gin Arg Trp Arg Lys 

^25 

CAG GGT GGT AAG GAT CTC CTT CTG CAG CTC AGG GAT CTG TCT GAC ACT 



435 



25 " ^''O 445 

ATT ATT CCA CCT TCT CCT GTT ATG TTT CCA GAG GAC AGA GAA ACA TGG 
lie lie Pro Pro Ser Pro Val Met Phe Pro Glu Asp Arg Glu Thr Trp 

455 

AAT GTT CAG CTA TTT AGA TCC ATT GAT GGT GGT GCT GCT TTT GOG TTC 
Asn val Gin Leu Phe Arg Ser He Asp Gly Gly Ala Ala Phe Gly Phe 



465 



470 



475 



480 



CCT GAT ACC CCT GAG GAG GCT GCA AAA GCT GGG CTT GTA AGC GOA AAG 
Pro ASP Thr Pro Glu Glu Ala Ala Lys Ala Gly Leu Val Ser Gly Lys 



485 



490 



495 



GAT CAA ATC ATT GAC AGG AGC ATC CAG GAT GCA TAC ATA CAT GCC ATC 
ASP Gin He He Asp Arg Ser He Gin Asp Ala Tyr He His Ala He 

CGG AGG GCA AAG AAC TTC ATC TAT ATA GAG AAC CAA TAC TTC CTT GGA 
Arg Arg Ala Lys Asn Phe He Tyr He Glu A.n Gin Tyr Phe Leu Gly 



1477 



1525 



1573 



1621 
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1717 
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40 



45 



50 



515 520 525 

AGT TCC TAT GCC TGG AAA CCC GAG GGC ATC AAG CCT GAA GAC ATT GGT X813 

5 

Ser Ser Tyr Ala Trp Lys Pro Glu Gly lie Lys Pro Glu Asp lie Gly 

530 535 
GCC CTG CAT TTG ATT CCT AAG GAG CTT GCA CTG AAA GTT GTC AGT AAG 1861 
Ala Leu His Leu He Pro Lys Glu Leu Ala Leu Lys Val Val Ser Lys 
^""^ 550 555 560 

75 ATT GAA GCC GGG GAA CGG TTC ACT GTT TAT GTT GTG GTG CCA ATG TGG 1909 

He Glu Ala Gly Glu Arg Phe Thr Val Tyr Val Val Val Pro Met Trp 
565 570 575 

20 

CCT GAG GGT GTT CCA GAG AGT GGA TCT GTT CAG GCA ATC CTG GAC TGG 1957 
Pro Glu Gly Val Pro Glu Ser Gly Ser Val Gin Ala He Leu Asp Trp 
25 580 585 590 

CAA AGG AGR ACA ATG GAG ATG ATG TAC ACT GAC ATT ACA GAG GCT CTC 2005 
Gin Arg Arg Thr Met Glu Met Met Tyr Thr Asp He Thr Glu Ala Leu 

600 605 
CAA GCC AAG GGA ATT GAA GCG AAC CCC AAG GAC TAC CTC ACT TTC TTC 2053 
Gin Ala Lys Gly He Glu Ala Asn Pro Lys Asp Tyr Leu Thr Phe Phe 

615 620 
TGC TTG GGT AAC CGT GAG GTG AAG CAG GCT GGG GAA TAT CAG CCT GAA 2101 
Cys Leu Gly Asn Arg Glu Val Lys Gin Ala Gly Glu Tyr Gin Pro Glu 

630 635 640 

GAA CAA CCA GAA GCT GAC ACT GAT TAC AGC GGA GCT CAG GAA GCT AGG 2149 
Glu Gin Pro Glu Ala Asp Thr Asp Tyr Ser Arg Ala Gin Glu Ala Arg 

645 650 655 

AGG TTC ATG ATC TAT GTC CAC ACC AAA ATG ATG ATA GTT GAC GAT GAG 2197 
Arg Phe Met He Tyr Val His Thr Lys Met Met He Val Asp Asp Glu 
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660 



665 



670 



TAC ATC ATC ATC OCT TCT GCA AAC ATC AAC CAG AGO TCG ATG GAC GOC 



2245 



675 



680 



685 

2293 



GCT AGG GAC TCT GAG ATC GCC ATG GGC GGC TAC CAG CCA TAC CAT CTG 
Ala Arg Asp Ser Glu II. Ala Met Gly Gly Tyr Gin Pro Tyr His l,eu 



695 



700 

2341 



GCG ACC AGG CAA CCA GCC CGT GGC CAG ATC CAT GGC TTC CGG ATG GCG 
Ala Thr Arg Gin Pro Ala Arg Gly Gin He His Gly Phe Arg Met Ala 
20 715 720 

CTG TGG TAC GAG CAC CTG GGA ATG CTG GAT GAT GTG TTC CAG CGC CCC 
Leu Trp Tyr Glu His Leu Gly Met Leu Asp Asp Val Phe Gin Arg Pro 

GAG AGC CTG GAG TGT GTG CAG AAG GTG AAC AGG ATC GCG GAG AAG TAC 
Glu ser Leu Glu Cys Val Gin Lys Val Asn Arg He Ala Glu Lys Tyr 



2389 



2437 



740 



750 

2485 



TGG GAC ATG TAC TCC AGC GAC GAC CTC CAG CAG GAC CTC OCT GGC CAC 
Trp Asp Met Tyr Ser Ser Asp Asp Leu Gin Gin Asp Leu Pro Gly His 

CTC CTC AGC TAC CCC ATT GGC GTC GCC AGC GAT GGT GTG GTG ACT GAG 
Leu Leu Ser Tyr Pro He Gly Val Ala Ser Asp Gly Val Val Thr Glu 



2533 



770 



775 



780 



CTG CCC GGG ATG GAG TAC TTT CCT GAC ACA CGG GCC CGC GTC CTC GGC 2581 
785 



795 

800 



GCC AAG TCG GAT TAC ATG CCC CCC ATC CTC ACC TCA TAGACGAGGA AGCACT 2633 
Ala Lys Ser Asp Tyr Met Pro Pro He Leu Thr Ser 
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805 




810 








ACACTACAAT CTGCTGGCTT 




TCCTTCTGTA 


CTTCTTCAGT 


TTGGTGGCGA 


2693 


GATGGTATGG CCGTTGTTCA 


GAATTTPTT n 


AGAATAGCAG 


TTGTTACAGT 


TGTGAATCAT 


2753 


AAAGTAATAA GTGCAGTATC 


TGTGCATGGT 




AGAAGATCGG 


GGATGCAATG 


2813 


ATGCTTGTGA AGTTGTGATG 


CCGTTTGTAA 


GATGGGAAGT 


TGGGAACTAC 


TAAGTAATTG 


2873 


GCATGATTGT ACTTTGCACT 


ACTGTTTAGC 


GTTGTTGATA 


CTGGTTAACC 


GTGTGTTCAT 


2933 


CTGAACTTGA TTCTTGATGC 


AGTTTGTGGC 


ATTACCAGTT 


TATCATCGTT 


CTTCAGGAAA 


2993 


^^^^'^'^^^AAAAA AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAA 




3040 



SEQ ID NO. :2 
SEQUENCE LENGTH: 812 
SEQUENCE TYPE: AMINO ACID 
25 MOLECULE TYPE: protein 

ORIGINAL SOURCE 

ORGANISM: Oryza sativa 
SEQUENCE DESCRIPTION 

35 ^ ^ 10 15 



Lys 

30 

40 Phe lie Arg Lys phe 

40 

45 



Ala Ala Ser Leu Ser Asn Pro His Arg Ala Ser Gly ser Ala Pro 

25 30 
Val Glu Gly He Glu Asp Thr Val Gly Val Gly 

Lys Gly Ala Thr Lys Val Tvr Ser Thr ti^ » 

lyr ser Thr lie Asp Leu Glu Lys Ala Arg 

" 70 „ 



55 
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Phe Thr Val Lys lie Asp Asn Pro lie Gly Ala Thr Asn He Gly Arg 

100 105 110 

Ala Tyr Leu Pro Val Gin Glu Leu Leu Asn Gly felu Glu He Asp Arg 

115 120 125 

Trp Leu Asp He Cys Asp Asn Asn Arg Glu Ser Val Gly Glu Ser Lys 

130 135 j^^Q 

He His Val Lys Leu Gin Tyr Phe Asp Val Ser Lys Asp Arg Asn Trp 

150 155 160 

Ala Arg Gly Val Arg Ser Thr Lys Tyr Pro Gly Val Pro Tyr Thr Phe 

165 170 175 

Phe Ser Gin Arg Gin Gly Cys Lys Val Thr Leu Tyr Gin Asp Ala His 

180 185 190 

Val Pro Asp Asn Phe He Pro Lys He Pro Leu Ala Asp Gly Lys Asn 

19S 200 205 

Tyr Glu Pro His Arg Cys Trp Glu Asp He Phe Asp Ala He Ser Asn 

210 215 220 

Ala Gin His Leu He Tyr He Thr Gly Trp Ser Val Tyr Thr Glu He 

230 235 240 

Thr Leu Val Arg Asp Ser Asn Arg Pro Lys Pro Gly Gly Asp Val Thr 

245 250 255 

Leu Gly Glu Leu Leu Lys Lys Lys Ala Ser Glu Gly Val Arg Val Leu 

260 265 270 

Met Leu Val Trp Asp Asp Arg Thr Ser Val Gly Leu Leu Lys Arg Asp 

2''5 280 285 

Gly Leu Met Ala Thr His Asp Glu Glu Thr Glu Asn Tyr Phe His Gly 
290 295 300 



16 



EP 0 685 559 A1 

Ser Asp Val Asn Cys Val Leu 



Cys Pro Arg Asn Pro Asp Asp 



Ser Gly 
320 



Ser He Val Gin Asp Leu Ser He Ser Thr Met Ph 



325 



330 



e Thr His His Gin 



335 



355 



360 



365 



Tyr A.P Thr Gin Tyr His Ser Leu Phe Arg Thr Leu Asp ser 



Thr His 



370 



375 



380 



His Asp Asp Phe His Gin Pro Asn Phe Ala Thr Ala Ser He Lys Lys 

390 :>Qs 

400 

Gly Gly Pro Arg Glu Pro Trp His Asn Tl^ «^ ^ o 

xjrp «xs Asp He His Ser Arg Leu Glu Gly 

Pro lie Ala Trp Asp Val Leu Tyr Asn Phe Glu Gin Arg Trp Arg Lys 
Gin Gly Gly Lys Asp Leu Leu Leu Gin Leu Arg Asp Leu Ser Asp Thr 
lie He Pro Pro Ser Pro Val Met Phe Pro Glu Asp Arg Glu Thr Trp 
A3n val Gin Leu Phe Arg Ser He Asp Gly Gly Ala Ala Phe Gly Phe 
Pro Asp Thr Pro Glu Glu Ala Ala Lys Ala Gly Leu Val Ser Gly Lys 
Asp Gin lie lie Asp Arg Ser He Gin Asp Ala Tyr He His Ala He 



500 



505 



510 



Arg Arg Ala Lys Asn Phe He Tyr He Glu Asn Gin Tyr Phe 



Leu Gly 
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540 

Ala Leu His Leu He Pro Lys Glu Leu Al. 



545 



550 



a Leu Lys Val Val Ser Lys 

560 

5« 

585 

«00 

S«0 685 
710 ,,5 

720 



55 
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25 



Glu Ser Leu Glu Cys Val Gin Lys Val Asn Arg He Ala Glu Lys Tyr 

740 745 750 

Trp Asp Met Tyr Ser Ser Asp Asp Leu Gin Gin Asp Leu Pro Gly His 

755 760 765 

Leu Leu Ser Tyr Pro He Gly Val Ala Ser Asp oiy Val Val Thr Glu 

^''^ 775 780 

Leu Pro Gly Met Glu Tyr Phe Pro Asp Thr Arg Ala Arg Val Leu Gly 

-^^0 795 800 

Ala Lys Ser Asp Tyr Met Pro Pro He Leu Thr Ser 
805 810 

20 

SEQ ID NO. :3 
SEQUENCE LENGTH: 2804 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE: cDNA to mRNA 
30 ORIGINAL SOURCE 

ORGANISM: 2ea maize 
SEQUENCE DESCRIPTION 

ATCACCGTCG TCATCAATCA CGGTGACCTT GCCCTCGCTG CCCGACTGGA ACCGGACGCT 60 
GCTGCTGCTG GTAGGCTGAC AGCGAGGAGG ACGAGACGAG GGGGCC ATG GCT CAG 115 
40 Met Ala Gin 

1 

ATC TTG CTC CAC GGC ACG CTC CAC GCC ACC ATC TTC GAG GCC GAG TCG 163 
lie Leu Leu His Gly Thr Leu His Ala Thr He Phe Glu Ala Glu Ser 

CTC TCC AAC CCG CAC CGC GCC ACT GGC GGC GCC CCC AAG TTC ATC CGC 211 
Leu ser Asn Pro His Arg Ala Thr Gly Gly Ala Pro Lys Phe He Arg 
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AAG CTT GTG GAA GGG ATC GAG GAC ACC GTG GGT GTC GGC AAG GGC GCC 259 
Lys Leu Val Glu Gly He Glu Asp Thr Val Gly Val Gly Lys Gly Ala 

'° « 50 

ACC AAG ATA TAT GCC ACC GTC GAT CTC GAG AAG GCC CGT GTC GGG CGG 307 
Thr Lys He Tyr Ala Thr Val Asp Leu Glu Lys Ala Arg Val Gly Arg 

60 65 



75 



20 



35 



40 



45 



50 



ACC CGG ATG ATC TCC AAC GAG CCC GTG AAC CCT CGT TGG TAC GAG TCC 
Thr Arg Met He Ser Asn Glu Pro Val Asn Pro Arg Trp Tyr Glu Ser 

75 80 
TTC CAC ATC TAC TGC GCG CAC ATG GCC GCC GAC GTC ATC TTC ACC GTC 
Phe His He Tyr Cys Ala His Met Ala Ala Asp Val He Phe Thr Val 

25 90 95 

AAG ATC GAC AAC TCC ATC GGG GCC TCG CTC ATC GGG AGG GCC TAC TTG 
Lys He Asp Asn Ser He Gly Ala Ser Leu He Gly Arg Ala Tyr Leu 

105 115 
GCT GTC CAG GAC CTC CTG GGA GGG GAG GAG ATC GAC AAG TGG CTT GAA 
Ala val Gin Asp Leu Leu Gly Gly Glu Glu He Asp Lys Trp Leu Glu 

125 130 
ATC TCC GAT GAA AAT CGT GAG CCT GTT GGG GAC AGC AAG ATC CAT GTG 
He Ser Asp Glu Asn Arg Glu Pro Val Gly Asp Ser Lys He His Val 

135 140 145 

AAG CTC CAG TAC TTT GAC GTC GGC AAG GAC CGT AAC TGG GCG AGG GGT 
Lys Leu Gin Tyr Phe Asp Val Gly Lys Asp Arg Asn Trp Ala Arg Gly 

155 160 
GTC CGG AGC ACC AAG TAC CCT GGT GTC CCT TAC ACC TTC TTC TCG CAG 
val Arg Ser Thr Lys Tyr Pro Gly Val Pro Tyr Thr Phe Phe Ser Gin 



355 



403 



451 



499 



547 



595 
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TO 



20 



25 



165 170 175 

AGG CAG GGG TGT AAG GTT ACT CTG TAG CAG GAC GCT CAT GTG CCG GAC 691 
Arg Gin Gly Cys Lys Val Thr Leu Tyr Gin Asp Ala His Val Pro Asp 
180 185 190 195 

AAC TTT GTT CCC AGG ATC CAG CTC GCT GAT GGC AAG AAC TAT GAG CCG 739 
Asn Phe Val Pro Arg lie Gin Leu Ala Asp Gly Lys Asn Tyr Glu Pro 
200 205 210 

^5 CAC AGG TGC TGG GAG GAT ATC TTT GAT GCT ATA AGC AAG GCT CAG CAT 7 87 

His Arg Cys Trp Glu Asp lie Phe Asp Ala He Ser Lys Ala Gin His 

215 220 225 

TTG ATT TAC ATC ACT GGC TGG TCC GTG TAC ACA GAG ATC ACC TTG GTC 835 
Leu He Tyr He Thr Gly Trp Ser Val Tyr Thr Glu He Thr Leu Val 

230 235 240 

AGG GAC ACC AAC AGG CCA AAA CCT GGT GGT GAT GTT ACT CTT GGG GAG 883 
Arg Asp Thr Asn Arg Pro Lys Pro Gly Gly Asp Val Thr Leu Gly Glu 
30 245 250 255 

TTG CTC AAG AGG AAG GCC AGT GAA GGT GTC CGG GTG CTT ATG CTG GTG 931 
Leu Leu Lys Arg Lys Ala Ser Glu Gly Val Arg Val Leu Met Leu Val 
260 265 270 275 

TGG GAT GAC AGG ACT TCT GTC GGC CTG CTT AAG AAG GAT GGC TTG ATG 979 
Trp TVsp Asp Arg Thr Ser Val Gly Leu Leu Lys Lys Asp Gly Leu Met 

280 285 290 

GCT ACC CAT GAT GAG GAG ACT GCA AAT TAC TTC CAT GGC ACG GAT GTC 1027 
45 Ala Thr His Asp Glu Glu Thr Ala Asn Tyr Phe His Gly Thr Asp Val 

295 300 305 

AAC TGT GTT CTG TGC CCT CGC AAC CCT GAT GAT TCT GGC AGC TTT GTC 1075 
Asn Cys Val Leu Cys Pro Arg Asn Pro Asp Asp Ser Gly Ser Phe Val 
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75 



20 



310 315 320 

CAG GAT CTG CAG ATA TCA ACT ATG TTC ACG CAC CAC CAG AAG ATA GTA 1123 

5 

Gin Asp Leu Gin lie Ser Thr Met Phe Thr His His Gin Lys lie Val 
325 330 335 

70 GTA GTC GAC CAT GAG ATG CGG AAC CAG GGA TCC CAG CAA AGG AGG ATA 1171 

Val Val Asp His Glu Met Pro Asn Gin Gly Ser Gin Gin Arg Arg lie 
340 345 350 355 

GTC AGO TTC ATT GGT GGC ATT GAC CTT TGT GAT GGA AGA TAT GAT ACC 1219 
Val Ser Phe lie Gly Gly lie Jksp Leu Cys Asp Gly Arg Tyr Asp Thr 

360 365 370 

CAG TAG CAC TCC TTG TTC AGG ACG CTT GAC ACT GTC CAT CAC GAT GAC 1267 
Gin Tyr His Ser Leu Phe Arg Thr Leu Asp Thr Val His His Asp Asp 
25 375 380 385 

TTC CAC CAG CCG AAC TTT GAG GGT GGG TCA ATC AAG AAA GGT GGC CCA 1315 
Phe His Gin Pro Asn Phe Glu Gly Gly Ser lie Lys Lys Gly Gly Pro 

390 395 400 

AGG GAG CCA TGG CAT GAT ATC CAC TCA CGG TTG GAA GGG CCA ATC GCT 1363 
Arg Glu Pro Trp His Asp lie His Ser Arg Leu Glu Gly Pro lie Ala 

405 410 415 

TGG GAT GTT CTT TAG AAC TTT GAG CAG AGA TGG AGA AAG CAG GGT GGT 1411 
40 Trp Asp Val Leu Tyr Asn Phe Glu Gin Arg Trp Arg Lys Gin Gly Gly 

420 425 430 435 

42G GAC CTC CTT GTG CGT CTC AGG GAT CTT CCT GAC ATT ATC ATC CCC 1459 
Lys Asp Leu Leu Val Arg Leu Arg Asp Leu Pro Asp lie lie He Pro 

440 445 450 

CCT TCT CCT GTG ATG TTC CCG GAG GAC AGA GAG ACA TGG AAT GTT CAG 1507 
Pro Ser Pro Val Met Phe Pro Glu Asp Arg Glu Thr Trp Asn Val Gin 
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TO 



20 



25 



455 460 465 

CTC TTC AGA TCC ATC GAT GGT GGT GCT GCT TTT GGC TTC CCC GAG ACT 1555 
Leu Phe Arg Ser lie Asp Gly Gly Ala Ala Phe Gly Phe Pro Glu Thr 

470 475 480 

CCC GAG GAA GCT GCA AGA GCT GGG CTT GTG AGT GGA AAG GAT CAA ATC 1603 
Pro Glu Glu Ala Ala Arg Ala Gly Leu Val Ser Gly Lys Asp Gin lie 
485 490 495 

?5 ATC GAC CGG AGT ATC CAG GAT GCA TAC GTA AAC GCC ATA CGG AGG GCG 1651 

lie Asp Arg Ser lie Gin Asp Ala Tyr Val Asn Ala lie Arg Arg Ala 
500 505 510 515 

AAG AAC TTC ATC TAC ATT GAG AAT CAG TAC TTC CTT GGA AGT TCA TAC 1699 
Lys Asn Phe lie Tyr lie Glu Asn Gin Tyr Phe Leu Gly Ser Ser Tyr 

520 525 530 

GGC TGG AAG CCC GAA GGC ATC AAG CCG GAA GAA ATC GGT GCT CTT CAC 17 47 
Gly Trp Lys Pro Glu Gly He Lys Pro Glu Glu He Gly Ala Leu His 
30 5 3 5 5 4 0 5 4 5 

TTG ATT CCG AAG GAG CTC TCG CTG AAG ATT GTC AGC AAG ATT GAA GCT 1795 
Leu He Pro Lys Glu Leu Ser Leu Lys He Val Ser Lys He Glu Ala 

550 555 560 

GGG GAG CGG TTT ACT GTT TAT GTT GTG GTG CCA ATG TGG CCT GAG GGT 18 43 
Gly Glu Arg Phe Thr Val Tyr Val Val Val Pro Met Trp Pro Glu Gly 

565 570 575 

GTT CCA GAA AGC GCT TCT GTT CAG GCA ATC CTT GAC TGG CAA AGG AGA 1891 
^5 Val Pro Glu Ser Ala Ser Val Gin Ala He Leu Asp Trp Gin Arg Arg 

580 585 590 595 

ACG ATG GAG ATG ATG TAC ACT GAC ATC GCA CAA GCT CTC GAA GCC AAC 1939 
Thr Met Glu Met Met Tyr Thr Asp He Ala Gin Ala Leu Glu Ala Asn 
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GGG ATT 
^ Gly lie 

AAC CGT 

10 

Asn Arg 

75 GAA CCT 

Glu Pro 
645 

20 

ATC TAT 
lie Tyr 
660 

25 

ATT GGG 
lie Gly 

30 

TCC GAG 
Ser Glu 

35 

CAG CCT 
Gin Pro 

40 

GAG CAC 
45 Glu His 

725 
GAG TGT 

50 

Glu Cys 



600 

GAA GCA AAC CCC 
Glu Ala Asn Pro 
615 

GAG GTA AAG CAG 
Glu Val Lys Gin 
630 

GAC ACT GAT TAC 
Asp Thr Asp Tyr 

GTT CAT ACC AAA 
Val His Thr Lys 
665 

TCC GCC AAC ATC 
Ser Ala Asn lie 
680 

ATC GCC ATG GGC 
lie Ala Met Gly 
695 

GCC CGG GGC CAG 
Ala Arg Gly Gin 
710 

CTG GGA ATG CTG 
Leu Gly Met Leu 

GTG CAG AAG GTG 
Val Gin Lys Val 



605 

AAG GAC TAT CTC 
Lys Asp Tyr Leu 
620 

GAG GGA GAA TAT 
Glu Gly Glu Tyr 
635 

ATC CGG GCT CAA 
He Arg Ala Gin 
650 

ATG ATG ATA GTG 
Met Met He Val 

AAC CAG AGG TCC 
Asn Gin Arg Ser 
685 

GCG TAC CAG CCG 
Ala Tyr Gin Pro 
700 

ATC CAT GGC TTC 
He His Gly Phe 
715 

GAA GAC GTC TTC 
Glu Asp Val Phe 
730 

AAC GAG GTC GCC 
Asn Glu Val Ala 



ACT TTC TTC TGC 
Thr Phe Phe Cys 
625 

GAA. CCA GAG GAG 
Glu Pro Glu Glu 
640 

GAG GCT AGG AGG 
Glu Ala Arg TVrg 
655 

GAC GAC GAG TAC 
Asp Asp Glu Tyr 
670 

ATG GAC GGT GCC 
Met Asp Gly Ala 

TAC CAC TTG GCG 
Tyr His Leu Ala 
705 

CGG ATG TCT CTT 
Arg Met Ser Leu 
720 

CAG CGG CCC GAG 
Gin Arg Pro Glu 
735 

GAG AAG TAC TGG 

Glu Lys Tyr Trp 



610 

TTA GGT 1987 
Leu Gly 

CAC CCA 2035 
His Pro 

TTT ATG 2083 
Phe Met 

ATC ATC 2131 
He He 
675 

AGG GAC 2179 

Arg Asp 

690 

ACT AGG 2227 
Thr Arg 

TGG TAC 2275 
Trp Tyr 

AGC GTA 2323 
ser Val 

GAC CTG 2 371 
Asp Leu • 
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740 745 750 755 

TAG TCG AGC GAC GAG CTG GAG GAG GAG GTG GCG GGG GAG GTC GTC AGG 2 419 

Tyr Ser Ser Asp Asp Leu Glu Gin Asp Leu Pro Gly His Leu Leu Ser 

760 765 770 

TAG GCG ATC GGT GTG ACT GCG GAG GGG AGG GTT AGG GAG GTG CCC GGG 2 4 67 
Tyr Pro He Gly Val Thr Ala Asp Gly Ser Val Thr Glu Leu Pro Gly 

775 780 785 

ATG GAG AAC TTC CCC GAC ACC GGG GCG CGC GTG GTC GGG AAC AAG TCG 2515 
Met Glu Asn Phe Pro Asp Thr Arg Ala Arg Val Leu Gly Asn Lys Ser 

790 795 800 

GAT TAG GTC GCG CCC ATC CTG AGC ACA TAGAGTGCAC AGTGCAGGCA GCGGCAT 2569 
Asp Tyr Leu Pro Pro He Leu Thr Thr 

805 810 
GGCTGCTCTC CTGTCTGGCC TCACCTTGGT GTGGCTGTGT TTGTGTTTGG GAGACTGGAG 2 62 9 
GTTCAGATTG CAGTGTTGAT ATTATATCGC CGCTGGGTGC AGAGGGATTC GACGTTATTG 2689 
AGTCATAATA AAATGCATTG TGCACGGTGG GAGACTGGGA GGATAGGAAT TATAGTTGTT 2749 
TATTACAGTA CGACTGCTTA CTGCATCCAG ATTGTGTTGT CGCTAAATW^ AAATUV 2 8 04 



SEQ ID NO. :4 
SEQUENCE LENGTH: 812 
SEQUENCE TYPE: AMINO ACID 
MOLECULE TYPE: protein 
ORIGINAL SOURCE 

ORGANISM: Zea maize 
SEQUENCE DESCRIPTION 

Met Ala Gin He Leu Leu His Gly Thr Leu His Ala Thr He Phe Glu 
15 10 15 
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70 



20 



25 



Ala Glu Ser Leu Ser Asn Pro His Arg Ala Thr Gly Gly Ala Pro Lys 

20 25 30 

Phe lie Arg Lys Leu Val Glu Gly He Glu Asp Thr Val Gly Val Gly 

35 40 45 

Lys Gly Ala Thr Lys He Tyr Ala Thr Val Asp Leu Glu Lys Ala Arg 

50 55 60 

Val Gly Arg Thr Arg Met He Ser Asn Glu Pro Val Asn Pro Arg Trp 
?5 65 70 75 80 

Tyr Glu Ser Phe His He Tyr Cys Ala His Met Ala Ala Asp Val He 

85 90 95 

Phe Thr Val Lys He Asp Asn Ser He Gly Ala Ser Leu He Gly Arg 

100 105 110 

Ala Tyr Leu Ala Val Gin Asp Leu Leu Gly Gly Glu Glu He Asp Lys 

115 120 125 

Trp Leu Glu He Ser Asp Glu Asn Arg Glu Pro Val Gly Asp Ser Lys 
30 130 135 140 

He His Val Lys Leu Gin Tyr Phe Asp Val Gly Lys T^p Arg Asn Trp 
145 150 155 160 

Ala Arg Gly Val Arg Ser Thr Lys Tyr Pro Gly Val Pro Tyr Thr Phe 

165 170 175 

Phe Ser Gin Arg Gin Gly Cys Lys Val Thr Leu Tyr Gin Asp Ala His 

IBO 185 190 

Val Pro Asp Asn Phe Val Pro Arg He Gin Leu Ala Asp Gly Lys Asn 

195 200 205 

Tyr Glu Pro His Arg Cys Trp Glu Asp He Phe Asp Ala He Ser Lys 

210 215 220 

Ala Gin His Leu He Tyr He Thr Gly Trp Ser Val Tyr Thr Glu He 
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225 

Thr Leu Val Arg 

Leu Gly Glu Leu 
260 

Met Leu Val Trp 
275 

Gly Leu Met Ala 
290 

Thr Asp Val Asn 
305 

Ser Phe Val Gin 

Lys lie Val Val 
340 

Arg Arg He Val 
355 

Tyr Asp Thr Gin 
370 

His Asp Asp Phe 
385 

Gly Gly Pro Arg 

Pro He Ala Trp 
420 

Gin Gly Gly Lys 
435 
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230 

Asp Thr Asn Arg 
245 

Leu Lys Arg Lys 

Asp Asp Arg Thr 
280 

Thr His Asp Glu 
295 

Cys Val Leu Cys 
310 

Asp Leu Gin He 
325 

Val Asp His Glu 

Ser Phe He Gly 
360 

Tyr His Ser Leu 
375 

His Gin Pro Asn 
390 

Glu Pro Trp His 
405 

Asp Val Leu Tyr 

Asp Leu Leu Val 
440 
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235 

Pro Lys Pro Gly 
250 

Ala Ser Glu Gly 
265 

Ser Val Gly Leu 

Glu Thr Ala Asn 
300 

Pro Arg Asn Pro 
315 

Ser Thr Met Phe 
330 

Met Pro Asn Gin 
345 

Gly He Asp Leu 

Phe Arg Thr Leu 
380 

Phe Glu Gly Gly 
395 

Asp He His Ser 
410 

Asn Phe Glu Gin 
425 

Arg Leu Arg Asp 



240 

Gly Asp Val Thr 
255 

Val Arg Val Leu 
270 

Leu Lys Lys Asp 
285 

Tyr Phe His Gly 

Asp Asp Ser Gly 
320 

Thr His His Gin 
335 

Gly Ser Gin Gin 
350 

Cys Asp Gly Arg 
365 

Asp Thr Val His 

Ser He Lys Lys 
400 

Arg Leu Glu Gly 
415 

Arg Trp Arg Lys 
430 

Leu Pro Asp He 
445 
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lie lie Pro Pro 
450 

Asn Val Gin Leu 
465 

Pro Glu Thr Pro 

Asp Gin lie lie 
500 

Arg Arg Ala Lys 
515 

Ser Ser Tyr Gly 
530 

Ala Leu His Leu 
545 

lie Glu Ala Gly 

Pro Glu Gly Val 
580 

Gin Arg Arg Thr 
595 

Glu Ala Asn Gly 
610 

Cys Leu Gly Asn 
625 

Glu His Pro Glu 
Arg Phe Met lie 



Ser Pro Val Met 
455 

Phe Arg Ser lie 
470 

Glu Glu Ala Ala 
485 

Asp Arg Ser lie 

Asn Phe lie Tyr 
520 

Trp Lys Pro Glu 
535 

lie Pro Lys Glu 
550 

Glu Arg Phe Thr 
565 

Pro Glu Ser Ala 

Met Glu Met Met 
600 

He Glu Ala Asn 
615 

Arg Glu Val Lys 
630 

Pro Asp Thr Asp 
645 

Tyr Val His Thr 



Phe Pro Glu Asp 
460 

Asp Gly Gly Ala 
475 

Arg Ala Gly Leu 
490 

Gin Asp Ala Tyr 
505 

He Glu Asn Gin 

Gly He Lys Pro 
540 

Leu Ser Leu Lys 
555 

Val Tyr Val Val 
570 

Ser Val Gin Ala 
585 

Tyr Thr Asp lie 

Pro Lys Asp Tyr 
620 

Gin Glu Gly Glu 
635 

Tyr He Arg Ala 
650 

Lys Met Met He 



Arg Glu Thr Trp 

Ala Phe Gly Phe 
480 

Val Ser Gly Lys 
495 

Val Asn Ala He 
510 

Tyr Phe Leu Gly 
525 

Glu Glu He Gly 

He Val Ser Lys 
560 

Val Pro Met Trp 
575 

He Leu Asp Trp 
590 

Ala Gin Ala Leu 
605 

Leu Thr Phe Phe 

Tyr Glu Pro Glu 
640 

Gin Glu Ala Arg 
655 

Val Asp Asp Glu 
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660 

Tyr lie lie lie 
675 

Ala Arg Asp Ser 
690 

Ala Thr Arg Gin 
705 

Leu Trp Tyr Glu 

Glu Ser Val Glu 
740 

Trp Asp Leu Tyr 
755 

Leu Leu Ser Tyr 
770 

Leu Pro Gly Met 
785 

Asn Lys Ser Asp 



Gly Ser Ala Asn 
680 

Glu lie Ala Met 
695 

Pro Ala Arg Gly 
710 

His Leu Gly Met 

725 

Cys Val Gin Lys 

Ser Ser Asp Asp 
760 

Pro lie Gly Val 
775 

Glu Asn Phe Pro 
790 

Tyr Leu Pro Pro 
805 



665 

lie Asn Gin Arg 

Gly Ala Tyr Gin 
700 

Gin lie His Gly 
715 

Leu Glu Asp Val 
730 

Val Asn Glu Val 
745 

Leu Glu Gin Asp 

Thr Ala Asp Gly 
780 

Asp Thr Arg Ala 
795 

lie Leu Thr Thr 
810 



670 ■ 
Ser Met Asp Gly 
685 

Pro Tyr His Leu 

Phe Arg Met Ser 
720 

Phe Gin Arg Pro 
735 

Ala Glu Lys Tyr 
750 

Leu Pro Gly His 
7 65 

Ser Val Thr Glu 

Arg Val Leu Gly 
800 



SEQ ID NO. : 5 
SEQUENCE LENGTH: 2799 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE: genomic DMA 
ORIGINAL SOURCE 

ORGANISM: Zea maize 
SEQUENCE DESCRIPTION 
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CAAGGGTGTA CATAGATTTG TCTCGTAAAA TAGTATTATA ATATTATAAA CTTATTACTC 60 

TATCCGTTCT AAAATATAAG AACCTTATGA CTGGATGGAA CATTTCCTAG TACTACGAAT 120 

CTGAACACAT GTCTAGATTC ATAGTACTAG GAAATGTCTC ATCGCGGTAC TAGGTTCTTA 180 

TATTTTAGGA TGGAGGGAGT TTAATATAAA ACTAATGGTT AGAACTTTGA AAGTTTGATT 24 0 

TTATSkATGTCA AATATTTATG GCTGGAGGTA GTATAATATG TTTTTTTTGG GACGTAGACT 300 

AGGTAGTATA ATATGTTTGG TTGTGTTTAG ATCCAATATT TGGATCCAAA CTTCAGTCAT 360 

TTTCCATCAC ATCAACTTGT CATATACACA TAACTTTTCA GTCACATCAT CCCCAATTTC 420 

AACCAAAATC AAACTTTGCG CTGAACTAAA CACAACCTTT GGGCCCGTTT AGTTCCCCAA 48 0 

TTTTTTTCCC AAAAACATCA CATCGAATCT TTGGACACAT GCATGAAGCA TTAAATATAG 54 0 

ATAAAAAGAA AAACTAATTG CACAGTTATG GAGGAAATCG CGAGACGAAT CTTTTAAGCC 600 

TAATTAGTCC GTGATTAGCC ATAAGTGCTA CAGTAACCCA ATTGTGCTAA TGACGGCTTA 660 

ATTAGTCTCC ACAAGATTCG TCTCGCAGTT TCCAGGCGAG TTCTGAAATT AGTTTTTTCA 720 

TTCGTGTCCG AAAACCCCTT CCGACATCCG GTCAAACGTT CGATATGACA CCCACAAATT 780 

TTCTTTTCCC CAACTAAACA CACCCTTTAT CTCTTACCCT CTGGCTCTTT CAGTAGGCAT 840 

ATCCAAGACA GCTGGTAATG CAGGCTCGGA CATAATTTGA CAGTTACGTT CATGTGACCG 900 

ACGGTTGATG CTAGTGCAAC TGCAACATAC TGTTCAGATG GATGTCCCfiA CGAGCTCAAA 960 

ACAACTTAGG TGGCGCGTCG CGATTCATCA ATAACTCAAA TGGAAGCGCA AGTGCACGTA 1020 

CGAAAATGAC AGCGAGTGAG GTGGCGAGCC TCACCTTGGT GATCCCAACC GGATAAGCTA 1080 

TGCATCAGCC AGTTTCGTGG GGCTGCACAT TTCGTCGAAC ACCTGGAGTC CACGCCGCCG 1140 

GCGACGTCGG CACAGCGCGC CCGCCCACCG CCCACGCACG CGCTTGACTC CACCCATGTT 1200 

CTCCCTTCTC GACGCCCGCG AAGCCAGCGA ACCGATCCGA GGAAGTCAAG CCCCCACCGC 1260 

CACTTGGACC GACCTCGGGA CGACGACGCC CCCGCGCTCT TCTAGACGCG CGGACGACGC 1320 

GGGCGCTGGC TCCGCGACGC GACGTCGCGG TCATGGAGTA ACCGCGACGG ACAGATACTT 1 3 8 0 

CTACCCGTTT TTAACCTCGC CTCCTCCTCC TCCCGGCTCG AGAtCCGTGG CCACGACGCG 1440 

TGGTGGGAAA CCGGGAACGA CGTGCACGCA CGCACACAGG GCAAGTTTCA GTAGAAAAAT 1500 

CGCCGGCATC CAGATCGGGA CAGTCTCTCT TCTCCCGCAA TTTTATAATC TCGCTCGATC 1560 

CAATCTGCTC CCCTTCTTCT TCTACTCTCC CCATCTCGGC TCTCGCCATC GCCATCCTCC 1620 
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TCTCCCTTCC CGGAGAAGAC GCCTCCCTCC GCCGATCACC ACCCGGTAAG CCCAGTGTGC 168 0 

TTAGGCTAAG CGCACTAGAG CTTCTTGCTC GCTTGCTTCT TCTCCGCTCA GATCTGCTTG 17 4 0 

CTTGCTTGCT TCGCTAGAAC CCTACTCTGT GCTGCGAGTG TCGCTGCTTC GTCTTCCTTC 18 00 

CTCAAGTTCG ATCTGATTGT GTGTGTGGGG GGGCGCAGGT AGGGCGAGGA GGGAGCCAAA 18 60 

TCCAAATCAG CAGCC ATG GCG CAG ATG CTG CTC CAT GGG ACG CTG CAC GCC 1911 
Met Ala Gin Met Leu Leu His Gly Thr Leu His Ala 
lb 10 

ACC ATC TTC GAG GCG GCG TCG CTC TCC AAC CCG CAC CGC GCC AGC GGA 195 9 
Thr lie Phe Glu Ala Ala Ser Leu Ser Asn Pro His Arg Ala Ser Gly 

15 20 25 

AGC GCC CCC AAG TTC ATC CGC AAG GTTCGGACCC TTCTCCTTAA TCTACTCGTC 2013 
Ser Ala Pro Lys Phe lie Arg Lys 

30 35 

TTTGCTCTTG CTCTTTTTCT TTTGTGTCCC TTTCTTGTGT GTGCGTTTGC ATGAGCCCGA 2073 

ATTTGATCTG CTAGTGCACA GTACAGTCAG ATACACTGAA ACGATCTGGA AATTCTGGAT 2133 

TATTAGGAAA AATAAAGAGG TAGTAGACAA GAATTGGAGA TACTTTCTAT CAAGATTGGT 2193 

CTATTATGCT TGGCCATTTC TTGTTTGACC CAAGTACTTC TTTGAATCTA GAGTTTGCTG 2253 

TGTGTGATGT GGTGTGTGTT TGTGTCACCA AAAATCTTCA TTAGCTAAAA CTGAAATTTT 2313 

ATTTATTAAC TGACCTACTA AAAATGTAGA GTTCTCTGTG TGTGATGTGT GCTTGTGTCA 2373 

CCAAAAATCT TGATTTGATA GAQTTTTTAT TTATTTATTA ACTGACCTAC TACTWITCTA 2433 

TTGCTGTATG CTATGTGTGT CTGTATCACC TGAAATGCAA TGTCTTCTTC TTTGTTGTTC 2 493 

TTGATCTAAC ACGTGAGCTC ATGTCAACAG TTT GTG GAG GGG ATT GAG GAC ACT 2547 

Phe Val Glu Gly lie Glu Asp Thr 
40 

GTG GGT GTC GGC AAA GGC GCC ACC AAG GTG TAT TCT ACC ATT GAT CTG 2595 
Val Gly Val Gly Lys Gly Ala Thr Lys Val Tyr Ser Thr lie Asp Leu 
45 50 55 60 
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GAG AAA GCT CGT GTA GGG CGA ACT AGG ATG ATA ACC AAT GAG CCC ATC 2643 
Glu Lys Ala Arg Val Gly Arg Thr Arg Met lie Thr Asn Glu Pro lie 
65 70 75 

AAC CCT CGC TGG TAT GAG TCG TTC CAC ATC TAT TGC GCT CAT ATG GCT 2691 
Asn Pro Arg Trp Tyr Glu Ser Phe His He Tyr Cys Ala His Met Ala 
80 85 90 

TCC AAT GTG ATC TTC ACT GTC AAG ATT GAT AAC CCT ATT GGG GCA ACG 2739 
ser Asn Val He Phe Thr Val Lys He Asp Asn Pro He Gly Ala Thr 
95 V 100 105 

AAT ATT GGG AGG GCT TAG CTG CCT GTC CAA GAG CTT CTC AAT GGA GAG 27 87 
Asn He Gly Arg Ala Tyr Leu Pro Val Gin Glu Leu Leu Asn Gly Glu 
110 115 120 

GAG ATT GAG AGA 
Glu He Asp Arg 

30 



20 



25 



2799 



35 



40 



45 



Claims 

1. A cloned DNA which encodes phospholipase D originated from a plant. 

2. The DNA according to claim 1, which encodes phospholipase D originated from a monocotyledonous 
plant. 

3. A cloned DNA which encodes an amino acid sequence shown in SEQ ID NO. 2 or an amino acid 
sequence having the same sequence as shown in SEQ ID NO. 2 except that one or more amino acids 
are added, deleted or substituted, said amino acid sequence giving enzyme activity of phospholipase 

4. The DNA according to claim 3. which encodes the amino acid sequence shown in SEQ ID NO. 2. 

6. A cloned DNA which encodes an amino acid sequence shown in SEQ ID NO. 4 or an amino acid 
sequence having the same sequence as shown in SEQ ID NO. 4 except that one or more amino acids 
are added, deleted or substituted, said amino acid sequence giving enzyme activity of phospholipase 

6. The DNA according to claim 5, whclh encodes the amino acid sequence shown in SEQ ID NO. 4. 

55 7. The DNA according to claim 3, which has a nucleotide sequence shown in SEQ ID NO. 1 or has the 
same nucleotide sequence as shown in SEQ ID NO. 1 except that one or more nucleotides are added 
deleted or substituted, said nucleotide sequence encodes an amino acid sequence giving enzyme 
activity of phospholipase D. 
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8- The DNA according to claim 7. which has a nucleotide sequence shown in SEQ ID NO. 1 . 

9. The DNA according to claim 5. which has a nucleotide sequence shown in SEQ ID NO. 3 or has the 
same nucleotide sequence as shown in SEQ ID NO. 3 except that one or more nucleotides are added, 
deleted or substituted, said nucleotide sequence encodes an amino acid sequence giving enzyme 
activity of phospholipase D. 

10. The DNA according to claim 9. which has a nucleotide sequence shown in SEQ ID NO. 3. 

11. The DNA according to claim 4, which has a nucleotide sequence from 182th to 2617th nucleotide in the 
nucleotide sequence shown in SEQ ID NO. 1 . 

12. The DNA according to claim 6. which has a nucleotide sequence from 107th to 2542th nucleotide in the 
nucleotide sequence shown in SEQ ID NO. 3. 

13. A cloned DNA which regulates expression of phospholipase D gene originated from a plant. 

14. A cloned DNA which has a nucleotide sequence shown in SEQ ID NO. 5 or has the same nucleotide 
sequence as shown in SEQ ID NO. 5 except that one or more nucleotides are added, deleted or 
substituted, said nucleotide sequence regulates expression of the DNA encoding an amino acid 
sequence giving enzyme activity of phospholipase D. 

15. The DNA according to claim 14, which has a nucleotide sequence shown in SEQ ID NO. 5. 

16. The DNA according to claim 5, which has a nucleotide sequence from 1st to 1875th nucleotide in the 
nucleotide sequence shown in SEQ ID NO. 5. 
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